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Abstract 

Consider a random graph model where each possible edge e is present independently with some 
probability p e . Given these probabilities, we want to build a large/heavy matching in the randomly 
generated graph. However, the only way we can find out whether an edge is present or not is to query 
it, and if the edge is indeed present in the graph, we are forced to add it to our matching. Further, 
each vertex i is allowed to be queried at most i, times. How should we adoptively query the edges to 
maximize the expected weight of the matching? We consider several matching problems in this general 
framework (some of which arise in kidney exchanges and online dating, and others arise in modeling 
online advertisements); we give LP-rounding based constant-factor approximation algorithms for these 
problems. Our main results are the following: 

• We give a 4 approximation for weighted stochastic matching on general graphs, and a 3 approxi- 
mation on bipartite graphs. This answers an open question from [Chen et al. ICALP 09]. 

• Combining our LP-rounding algorithm with the natural greedy algorithm, we give an improved 
3.46 approximation for unweighted stochastic matching on general graphs. 

• We introduce a generalization of the stochastic online matching problem [Feldman et al. FOCS 09] 
that also models preference-uncertainty and timeouts of buyers, and give a constant factor approx- 
imation algorithm. 

1 Introduction 

Motivated by applications in kidney exchanges and online dating, Chen et al. 13 proposed the following 
stochastic matching problem: we want to find a maximum matching in a random graph G on n nodes, 
where each edge (i , j) £ [Qj] exists with probability pij, independently of the other edges. However, all we 
are given are the probability values {pij}. To find out whether the random graph G has the edge (i , j) or not, 
we have to try to add the edge (i, j) to our current matching (assuming that i and j are both unmatched in 
our current partial matching) — we call this "probing" edge (i, j). As a result of the probe, we also find out 
if (i, j) exists or not — and if the edge (i, j) indeed exists in the random graph G, it gets irrevocably added 
to M. Such policies make sense, e.g., for dating agencies, where the only way to find out if two people 
are actually compatible is to send them on a date; moreover, if they do turn out to be compatible, then it 
makes sense to match them to each other. Finally, to model the fact that there might be a limit on the number 
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of unsuccessful dates a person might be willing to participate in, "timeouts" on vertices are also provided. 
More precisely, valid policies are allowed, for each vertex i, to only probe at most t$ edges incident to i. 
Similar considerations arise in kidney exchanges, details of which appear in Q. 

Chen et al. asked the question: how can we devise probing policies to maximize the expected cardinality 
(or weight) of the matching? They showed that the greedy algorithm that probes edges in decreasing order 
of pij (as long as their endpoints had not timed out) was a 4-approximation to the cardinality version of 
the stochastic matching problem. This greedy algorithm (and other simple greedy schemes) can be seen to 
be arbitrarily bad in the presence of weights, and they left open the question of obtaining good algorithms 
to maximize the expected weight of the matching produced. In addition to being a natural generalization, 
weights can be used as a proxy for revenue generated in matchmaking services. (The unweighted case can 
be thought of as maximizing the social welfare.) In this paper, we resolve the main open question from 
Chen etal. Q: 

Theorem 1 There is a 4-approximation algorithm for the weighted stochastic matching problem. For bi- 
partite graphs, there is a ^-approximation algorithm. 

Our main idea is to use the knowledge of edge probabilities to solve a linear program where each edge 
e has a variable < y e < 1 corresponding to the probability that a strategy probes e (over all possible 
realizations of the graph). This is similar to the approach for stochastic packing problems considered by 
Dean et al. EE]. We then give two different rounding procedures to attain the bounds claimed above. 

• The first algorithm ( £12.11 ) is very simple: it considers edges in a uniformly random order and probes 
each edge e with probability proportional to y e ; the analysis uses Markov's inequality and a Chernoff- 
type bound (Lemma [2]). 

• The second algorithm ( ^2.21 ) is more nuanced and achieves a better bound: we use dependent round- 
ing ifTTTl on the y-values to obtain a set E of edges to be probed, and then probe edges of E in a 
uniformly random order. 

Though the first algorithm has a weaker approximation ratio, we still present it since it is useful in the online 
stochastic matching problem (Section [3]). 

The second rounding algorithm has an additional advantage: The probing strategy returned by the algo- 
rithm can be made matching-probing 0. In this alternative (more restrictive) probing model we are given 
an additional parameter k and edges need to be probed in k rounds, each round being a matching. It is 
clear that this matching-probing model is more restrictive than the usual edge-probing model (with timeouts 
minjij, k}) where one edge is probed at a time. Our algorithm obtains a matching-probing strategy that is 
only a small constant factor worse than the optimal edge-probing strategy; hence, we also obtain the same 
constant approximation guarantee for weighted stochastic matching in the matching-probing model. It is 
worth noting that previously only a logarithmic approximation in the unweighted case was known Q. 

Theorem 2 There is a 4-approximation algorithm for the weighted stochastic matching problem in the 
matching-probing model. For bipartite graphs, there is a ^-approximation algorithm. 

Notice that for general graphs our algorithm matches the performance of the greedy algorithm shown by 
Chen et al. Q for the unweighted case. Interestingly, even though their individual analyses show that they 
are 4-approximations, they can be combined to obtain better approximations. 

Theorem 3 There is a 3A6-approximation algorithm for the unweighted stochastic matching problem in 
general graphs. 

Apart from solving these open problems and yielding improved approximations, our LP-based analysis 
turns out to be applicable in a wider context. 
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Online Stochastic Matching Revisited. In a bipartite graph (A, B; E) of items i G A and potential buyer 
types j G B, pij denotes the probability that a buyer of type j will buy item i. A sequence of n buyers are 
to arrive online, where the type of each buyer is an i.i.d. sample from B according to some pre-specified 
distribution — when a buyer of type j appears, he can be shown a list L of up to tj as-yet-unsold items, and 
the buyer buys the first item on the list according to the given probabilities p.j. (Note that with probability 
[3 ieL (l — pij), the buyer leaves without buying anything.) What items should we show buyers when they 
arrive online, and in which order, to maximize the expected weight of the matching? Building on the 
algorithm for stochastic matching in ^2.31 we prove the following in Section [3] 

Theorem 4 There is a 7 ' .^2-approximation algorithm for the above online stochastic matching problem. 

This question is an extension of similar online stochastic matching questions considered earlier in ifTOll — 
in that paper, Wij,pij 6 {0, 1} and tj = 1. Our model tries to capture the facts that buyers may have a 
limited attention span (using the timeouts), they might have uncertainties in their preferences (using edge 
probabilities), and that they might buy the first item they like rather than scanning the entire list. 

A New Proof for Greedy. The proof in Q that the greedy algorithm for stochastic matching was a 4- 
approximation in the unweighted case was based on a somewhat delicate charging scheme involving the 
decision trees of the algorithm and the optimal solution. We show (Appendix |B]) that the greedy algorithm, 
which was denned without reference to any LPs, admits a simple LP-based analysis. 

Theorem 5 The greedy algorithm is a ^-approximation for the unweighted stochastic matching problem. 

Cardinality Constrained Matching in Rounds. We also consider the model from where one can 
probe as many as C edges in parallel, as long as these C edges form a matching; the goal is to maximize 
the expected weight of the matched edges after k rounds of such probes. We improve on the minjfc, C}- 
approximation offered in Q (which only works for the unweighted version), and show in Appendix |Aj 

Theorem 6 There is a constant-factor approximation algorithm for weighted cardinality constrained multiple- 
round stochastic matching. 

Extension to Hypergraphs. We extend our analysis to a much more general situation where we try to pack 
A;-hyperedges with random sizes into a c?-dimensional knapsack of a given size; this is just the stochastic 
knapsack problem of (H, but where we consider the situation where k <C d. For this setting of parameters, 
we improve on the \/d-approximation of [8] to prove the following (Section |4|. 

Theorem 7 There is a Ik-approximation algorithm for the weighted stochastic k-set-packing problem. 

We note that the stochastic A;-set-packing problem is a direct generalization of the stochastic matching 
problem; so an 8-approximation for stochastic matching follows from Theorem [7] However, using more 
structure in the matching problem, we could obtain the better approximation ratios in Theorems Q] and [2] 
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1.1 Related Work. 



As mentioned above, perhaps the work most directly related to this work is that on stochastic knapsack 
problems (Dean et al. @[8|]) and multi-armed bandits (see |[T3l[T4l and references therein). Also related is 
some recent work (4[ on budget constrained auctions, which uses similar - LP rounding ideas. 

In recent years stochastic optimization problems have drawn much attention from the theoretical com- 
puter science community where stochastic versions of several classical combinatorial optimization problems 
have been studied. Some general techniques have also been developed |[T5ll23l . See [24] for a survey. 

The online bipartite matching problem was first studied in the seminal paper by Kaip et al. lfT7l and an 
optimal 1 — 1/e competitive online algorithm was obtained. Katriel et al. |[T8l considered the two-stage 
stochastic min-cost matching problem. In their model, we are given in a first stage probabilistic information 
about the graph and the cost of the edges is low; in a second stage, the actual graph is revealed but the costs 
are higher. The original online stochastic matching problem was studied recently by Feldman et al. ifTOll . 
They gave a 0.67-competitive algorithm, beating the optimal 1 — 1/e-competitiveness known for worst-case 
models ITT71 [T6l |2T1 [5] [El- Our model differs from that in having a bound on the number of items each 
incoming buyer sees, that each edge is only present with some probability, and that the buyer scans the list 
linearly (until she times out) and buys the first item she likes. Recently, some improved bounds on this 
model were obtained I2ll20l. 

Our problem is also related to the Adwords problem OTTl . which has applications to sponsored search 
auctions. The problem can be modeled as a bipartite matching problem as follows. We want to assign every 
vertex (a query word) on one side to a vertex (a bidder) on the other side. Each edge has a weight, and there 
is a budget on each bidder representing the upper bound on the total weight of edges that may be assigned 
to it. The objective is to maximize the total revenue. The stochastic version in which query words arrive 
according to some known probability distribution has also been studied |[T9l . 

1.2 Preliminaries. 

For any integer m > 1, define [m] to be the set { 1 , . . . , m}. For a maximization problem, an a-approximation 
algorithm is one that computes a solution with expected objective value at least 1/a times the expected value 
of the optimal solution. 

We must clarify here the notion of an optimal solution. In standard worst case analysis we would 
compare our solution against the optimal offline solution, e.g. the value of the maximum matching, where 
the offline knows all the edge instantiations in advance (i.e. which edge will appear when probed, and 
which will not). However, it can be easily verified that due to the presence of timeouts, this adversary is 
too strong Q. Consider the following example. Suppose we have a star where each vertex has timeout 1, 
and each edge has pij = 1/n. The offline optimum can match an edge whenever the star has an edge i.e. 
with probability about 1 — 1/e, while our algorithm can only get expected 1/n profit, as it can only probe a 
single edge. Hence, for all problems in this paper we consider the setting where even the optimum does not 
know the exact instantiation of an edge until it is probed. This gives our algorithms a level playing field. The 
optimum thus corresponds to a "strategy" of probing the edges, which can be chosen from an exponentially 
large space of potentially adaptive strategies. 

We note that our algorithms in fact yield non-adaptive strategies for the corresponding problems, that 
are only constant factor worse than the adaptive optimum. This is similar to previous results on stochastic 
packing problems: knapsack (Dean et al. [H) and multi-armed bandits (Guha-Munagala |[T3l [141 and 
references therein). 



4 



2 Stochastic Matching 



We consider the following stochastic matching problem. The input is an undirected graph G = (V,E) with 
a weight w e and a probability value p e on each edge e G £\ In addition, there is an integer value t v for each 
vertex v £ V (called patience parameter). Initially, each vertex v G V has patience t v . At each step in the 
algorithm, any edge e(u, v) such that u and v have positive remaining patience can be probed. Upon probing 
edge e, one of the following happens: (1) with probability p e , vertices u and v get matched and are removed 
from the graph (along with all adjacent edges), or (2) with probability 1 — p e , the edge e is removed and the 
remaining patience numbers of u and v get reduced by 1. An algorithm is an adaptive strategy for probing 
edges: its performance is measured by the expected weight of matched edges. The unweighted stochastic 
matching problem is the special case when all edge-weights are uniform. 

Consider the following linear program: as usual, for any vertex v G V, d(v) denotes the edges incident 
to v. Variable y e denotes the probability that edge e = (u, v) gets probed in the adaptive strategy, and 
x e = Pe • He denotes the probability that u and v get matched in the strategy. (This LP is similar to the LP 
used for general stochastic packing problems by Dean, Goemans and Vondrak ll8l.) 

maximize w e ■ x e (LP1) 

eeE 

subject to 

x e < 1 Vv G V (1) 

e£d(v) 

Ve<U Vv G V (2) 

Xe — Pe' He Ve G E (3) 

< y e < 1 Vee£ (4) 

The following claim shows that the LP above is a valid relaxation for the stochastic matching problem. 

Claim 1 The optimal value for LP (|LP1I) is an upper bound on any (adaptive) algorithm for stochastic 
matching. 

Proof: To show this, it suffices to show that any adaptive strategy satisfies the constraints of the LP. Con- 
ditioned on any instantiation of all edges in E (i.e. each edge e G E is present with probability p e ), the 
expected number of probes involving any vertex v G V is at most t v (the patience parameter). Similarly 
conditioning on edges E, the expected number of matched edges involving v G V is at most 1. Hence these 
constraints hold unconditionally as well, which implies that any valid strategy satisfies CD) and (0. ■ 

2.1 Weighted Stochastic Matching: General Graphs 

Our algorithm first solves (|LP1I) to optimality and uses the optimal solution (x, y) to obtain a non-adaptive 
strategy achieving expected value £7(1) - (w -x). Next, we present the algorithm. Let (x, y) denote an optimal 
solution to the above LP, which by Claim Q] gives an upper-bound on any adaptive strategy. Let a > 1 be 
a constant to be set later. The algorithm first fixes a uniformly random permutation ir on edges E. It then 
inspects edges in the order of n, and probes only a subset of the edges. A vertex v G V is said to have timed 
out if t v edges incident to v have already been probed (i.e. its remaining patience reduces to 0); and vertex 
v is said to be matched if it has already been matched to another vertex. An edge (tt, v) is called safe at the 
time it is considered if (A) neither u nor v is matched, and (B) neither u nor v has timed out. The algorithm 
is the following: 
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1. Pick a permutation 7r on edges E uniformly at random 

2. For each edge e in the ordering 7r, do: 

a. If e is safe then probe it with probability y e /a, else do not probe it. 



In the rest of this section, we prove that this algorithm achieves a 5.75-approximation for the weighted 
stochastic matching problem. Even though this is slightly worse that the approximation factors claimed in 
Theorem [T] this first algorithm is significantly simpler, it readily illustrates the power of the LP approach, 
and, as we shall see in § HI it can handle a much more general version of the basic problem. 

We begin with the following property: 

Lemma 1 For any edge (u, v) E E, at the point when (u, v) is considered under it, 

(a) the probability that vertex u has timed out is at most and 

(b) the probability that vertex u is matched is at most 

Proof: We begin with the proof of part (a). Let random variable U denote the number of probes incident to 
vertex u by the time edge (u, v ) is considered in n. 

E[U}= Pledge e appears before (u, v) in n AND e is probed ] , 

eed(u) 

< Prfedge e appears before (u, v) in n] ■ — , 

EVe_ 
2a' 

e&d(u) 

~ 2a' 

The first inequality above follows from the fact that the probability that edge e is probed (conditioned on 
7r) is at most y e /a. The second equality follows since it is a u.a.r. permutation on E. The last inequality 
is by the LP constraint (f2]). The probability that vertex u has timed out when (u, v) is considered equals 
Pr[ U > t u ] < < ^ , by the Markov inequality. This proves part (a). The proof of part (b) is identical 
(where we consider the event that an edge is matched instead of being probed and replace y e and t u by x e 
and 1 respectively and use the LP constraint £jQ)) and is omitted. ■ 

Now, a vertex u G V is called low -timeout iit u = 1, else u is called a high-timeout vertex if t u > 2. We 
next prove the following bound for high-timeout vertices that is stronger than the one from LemmaQJa). 



Lemma 2 Suppose a > e. For a high-timeout vertex u G V, and any edge f incident to u, the probability 

2 

3c?" 



that u has timed out when f is considered in ir is at most 



Proof: Let t = t u > 2 denote the patience parameter for vertex u, and F = d(u) \ {/} the set of edges 
incident to u excluding /. Then the probability that u has timed out when / is considered under ir is upper 
bounded by: 

y~] Pr [ edges pi, - ■ ■ ,pt appear before / in it AND are all probed ] , (5) 

{Pir-',Pi}CF 

< — ■ ^ Pr [ edges p\ , ■ ■ ■ ,pt appear before / in tt AND are all probed ] , (6) 

Pi,— ,Pt&F 
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< — ■ ^2 P r [ edges pi, ■■■ ,p t appear before / in it ]• — , (7) 

pi,---,pteF t=\ 

(8) 




(9) 

zF ) 

- TtTTy. I ~V- 00) 

In the above, the summation in © is over unordered t-tuples whereas the subsequent ones ©-([81) are over 
ordered tuples (with repetition). Inequality © uses the fact that for any edge g, the probability of probing 
g conditioned on it and the outcomes until g is considered, is at most y g /a (and the fact that the probability 
of probing an edge is independent of the probability of probing other edges). Equation ([8]) follows from the 
fact that probability that / is the last to appear among {p\, ■ ■ ■ ,p t , /} in a random permutation it is 
Finally, (flOl) follows from the LP constraint © at it. 

Let f(t) := u+u\ ■ (^) • We claim that f(t) < when a > e and t > 2, which would prove the 
claim. Note that this is indeed true for t = 2 (in fact with equality). Also f(t + 1) < f(t) for all t > 2 due 
to: 

/(t+i) _ ft + iy t + i i < e < l 



/(*) v * / * + 2 « ~ a 

Thus we obtain the desired upper bound. ■ 
Using this, we can analyze the probability that an edge is safe. 

Lemma 3 For a > e, an edge f = (u, v) is safe with probability at least (1 — — — when f is considered 
under a random permutation ir. 

Proof: The analysis proceeds by considering the following cases. 

1. Both u and v are low-timeout. Since t u = t v = 1, the event that u (resp. v) is matched at any point is 
a subset of the event that u (resp. v) has timed out. Thus by Lemma [H the probability that edge / is 
not safe (when it is considered) is < 

2. Both u and v are high-timeout. Lemma [2] implies that the probability that u (resp. v) has timed out is 
at most ^j-. Again by Lemma [TJ the probability that u (resp. v) is matched is at most Thus the 
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probability that / is not safe is at most — + 



3. u is low-timeout and v is high-timeout. Using the argument in Step (Q} for vertex u, the probability 

Y 



that vertex u has timed out or matched is at most J^. And using Step © for vertex v, the probability 
that vertex v has timed out or matched is at most ^ + So the probability that edge (u, v) is not 



safe is at most - + 

a 3a 



Hence every edge is safe (when considered in it) with probability > (1 — ^ — -^i)- ■ 

Theorem 8 Setting a = 1 + y/5 in the above algorithm gives an b.lh-approximation for the weighted 
stochastic matching problem. 
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Proof : Theorem Q] Given that an edge e G E is safe when considered, the expected profit for the algorithm 
is w e ■ p e *j = io e • a; e /a. Now using LemmaO the algorithm gets expected profit at least — ^7 — 7^3-) 

times the optimal LP value. Plugging in a = 1 + \/5 gives an approximation ratio of ~[^^^ < 5.75, as 
desired. ■ 



2.2 Weighted Stochastic Matching: Bipartite Graphs 

In this section, we obtain an improved bound for stochastic matching on bipartite graphs via a different 
rounding procedure. In fact, the algorithm produces a matching -probing strategy whose expected value is a 
constant fraction of the optimal value of (ILP1I) (which was for edge-probing). 

Algorithm. First, we find an optimal fractional solution (x, y) to (1LP1I) and round y to identify a set of 
interesting edges E. Then we use Konig's Theorem 11221 Ch. 20] to partition E into a small collection of 
matchings M\ , . . . , M^. Finally, these matchings are then probed in random order. If we are only interested 
in edge-probing strategies, probing the edges in E in random order would suffice. We will refer to this 
algorithm as ROUND-COLOR-PROBE: 

1. (x, y) <— optimal solution to (1LP1I) 

2. y <— round y to an integral solution using GKSP 

3. E <- {e G E : y e = 1} 

4. Mi, . . . , Mh <— optimal edge coloring of E 

5. For each M in {Mi , . . . , M^} in random order, do: 

a. probe {(u, v) G M : u and v are unmatched} 

Besides the edge coloring step, the key difference from the algorithm of the previous subsection is in the 
choice of E. For this we use the GKSP procedure of Gandhi et al. iPTTll . which we describe next. 

The GKSP algorithm. We state some properties of the dependent rounding framework of Gandhi et al. ifTTI 
that are relevant in our context. 

Theorem 9 ( 1111 ) Let {A, B; E) be a bipartite graph and z e G [0, 1] be fractional values for each edge 
e G E. The GKSP algorithm is a polynomial-time randomized procedure that outputs values Z e G {0,1} 
for each e G E such that the following properties hold: 

PI. Marginal distribution. For every edge e, Pr[Z e = 1] = z e . 

P2. Degree preservation. For every vertex u G A U B, ^ e e9(«) — TSeeiSu Ze \ ' 

P3. Negative correlation. For any vertex u and any set of edges S C d(u): 

Pr[/\(Z e = l)]<IJPr[^e = l]. 

eeS eeS 

We note that the GKSP algorithm in fact guarantees stronger properties than the ones stated above. For 
the purpose of analyzing ROUND-COLOR-PROBE, however, the properties stated above will suffice. 
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Feasibility. Let us first argue that our algorithm outputs a feasible strategy. If we care about feasibility in 
the edge-probing model, we only need to show that each vertex u is not probed more than t u times. The 
following lemma shows that: 

Lemma 4 For every vertex u, ROUND-COLOR-PROBE probes at most t u edges incident on u. 

Proof: Vertex u is matched in |{e £ e?g(u)}| matchings. This is an upper bound on the number of times 
edges incident on u probed. Hence we just need to show that this quantity is at most t u . Indeed, 



\{eed 2 (u)}\ = £ y e < \ £ Ve 



— tui 



where the first inequality follows from the degree preservation property of Theorem [9] and the second in- 
equality from the fact that y is a feasible solution to (1LP1|) . ■ 

Let us argue that the strategy is also feasible under the matching-probing model. Recall that in the latter 
model we are given an additional parameter k (which without loss of generality we can assume to be at 
most max„ 6 y t u ) and we can probe edges in k round, with each round forming a matching. Let E be the 
set of edges in the support of y, i.e., E = {e £ E \ y e = 1}. Let h = max„ g y deg^(f ) < max„ g y t v . 
Konig's Theorem allows us to decomposed E into h matchings. Therefore, the probing strategy devised by 
the algorithm is also feasible in the matching-probing model. 

Performance guarantee. Let us focus our attention on some edge e = (u, v ) 6 E. Our goal is to show 
that there is good chance that the algorithm will indeed probe e. We first analyze the probability of e being 
probed conditional on E. Notice that the algorithm will probe e if and only if all previous probes incident 
on u and v were unsuccessful; otherwise, if there was a successful probe incident on u or v, we say that e 
was blocked. 

Let 7T be a permutation of the matchings M\, . . . , Mh- We extend this ordering to the set E by listing the 
edges within a matching in some arbitrary but fixed order. Let us denote by B(e, n) C E the set of edges 
incident on u or v that appear before e in it. It is not hard to see that 



Pr [ e was not blocked | ^J^E^^ 0- ~ Pf) I E 

/6B(e,7r) 



(11) 



here we assume that ri/e_B(e w)(^ ~ Pf) = ^ wn en B(e, it) = 0. 

Notice that in (fTTTt we only care about the order of edges incident on u and v. Furthermore, the expec- 
tation does not range over all possible orderings of these edges, but only those that are consistent with some 
matching permutation. We call this type of restricted ordering random matching ordering and we denote it 
by 7r; similarly, we call an unrestricted ordering random edge ordering and we denote it by a. Our plan is 
to study first the expectation in (fTTTt over random edge orderings and then to show that the expectation can 
only increase when restricted to range over random matching orderings. 

The following simple lemma is useful in several places. 

Lemma 5 Let r and p ma x be positive real values. Consider the problem of minimizing nf =1 (l — Pi) 
subject to the constraints ^4=1 Pi — r an d < p% < Pmax for i = 1, . . . , t. Denote the minimum value by 
r](r,p ma _ x ). Then, 

7?(r,Pmax) = (1 - Pmax)!- 55 ™-! ( 1 ~ (r ~ ^~ Pmax)) > (1 - Pmax) r/?W • 

\ /-'111 3.X / 
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Proof: Suppose the contrary that the quantity is minimized but there are two p{S that are strictly between 
and p m ax- W.l.o.g, they are pi,p2 and p\ > p2 Let e = min(p max — Pi,P2)- It is easy to see that 

t t t 

(1 - (p! + e))(l - (pa - e)) JJ(1 " Pi) " 1(1 " Pi) = £ (P2 - Pi - e) JJ(1 - Pi ) < 0. 



i=3 



i=l 



i=3 



This contradicts the fact the quantity is minimized. Hence, there is at most one pi which is strictly between 
and p max . 

The last inequality holds since 1 - b > (1 - a) b / a for any < b < a < 1. ■ 
Let c^(e) be the set of edges in i£ incident on either endpoint of e excluding e itself. 

Lemma 6 Le? e an edge in E and let a be a random edge ordering. Let p max = max^ £ ^ pf. Assume 
that T,fea p (e)Pf = r - Then > 



E, 



{ n (i-p/)i^]>/ 



rj(xr, xp max ) dx. 



feB(e,a) 

Proof: We claim that the expectation can be written in the following continuous form: 



E r 



n (i-p/) i e\ = f n (! 



xp/) dx. 



f&B(e,<7 



/e%(e) 



The lemma easily follows from this and Lemma [5] 

To see the claim, we consider the following random experiment: For each edge / G d(e), we pick 
uniformly at random a real number aj in [0,1]. The edges are then sorted according to these numbers. It 
is not difficult to see that the experiment produces uniformly random orderings. For each edge /, let the 
random variable Af = 1 — p/ if / G B(e, a) and Af = 1 otherwise. Hence, we have 



E, 



[ n (i- P /)i^] = / 1 e[ n 



Af I a P = x 



f£B{e,cr) 



fed s (e) 

n e 

/69s(e) 



A /• I a P = x 



dx 



dx 



I n (x(i- Pf )+(i-x))dx 

fed g (e) 

/ II (! _ xp /) dx 



/e%(e) 

The second equality holds since the Af variables, conditional on a e = x, are independent. 

Lemma 7 Let p(r, p max ) = Jq rj(xr, xp max ) dx. For any r, p max > 0, we have 
1. p(r, Pmax.) is convex and decreasing on r. 
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2. p (r,p max ) > •(!-(!- Pmax) 1+ ^- J > r 



Proof: To see the first part, let us consider the function values on discrete points r = p ma , x , 2p max , .... Let 
F(x) = Ml — c x ) where c = 1 — p max . From the above derivation, we can easily get that for integral t, 



,Pmax) = / (1 - X Pmax fdx = (1 - C* +1 ) = —F(t + 1). 



The function F(x) is a convex function for any < c < 1. Indeed, it is not hard to prove that -$-jF(x) = 
J? + ° X + ~ ^) > for any < c < 1. However, p (tp max ,p max ) only coincides with 

j^—F(t + 1) at integral values of t. Now, let us consider the value of p(r, p ma , x ) for 7p max < r < 
(7 + l)Pma X : 

P (^,Pmax) = J (1 - SPmax) 7 (l ~ x(r - 7Pmax)) dx (12) 

The key observation is that for fixed values of p max and 7 the right hand side of (fl~2l is a just linear function 
of r. The dependency of p in terms of r then becomes clear: it is a piecewise linear function that takes the 
value F(t + 1) at abscissa points tp m3iX for t G Zo- Therefore, p is a convex decreasing function of r. 
The second part follows easily from Lemma |5J 

p(r,p max )= / rj(xr,xp max )dx > / (1 - xp max ) r/Pmax dx 
Jo Jo 

= — ; (1 - (1 - p max ) 1+ ^] > — ; ( 1 - e" 



Lemma 8 Let e = (u, v) G £7. Lef n be a random matching ordering and a be a random edge ordering of 
the edges adjacent to u and v. Then 

n (i-p/) 1 e] >E a [ n (i- Pf )\E. 

feB{e,n) feB(e,cr) 

Proof: We can think of tt as a permutation of bundles of edges: For each matching, if there are two edges 
incident on e, we bundle the edges together; if there is a single edge incident on e this edge is in a singleton 
bundle by itself. The random edge ordering a can be thought as having all edges incident on e in singleton 
bundles. 

Consider the same random experiment as in Lemma [6] except that we only pick one random number for 
each bundle. Let G(e) be the set of all bundles incident on e. Using the same argument, we have 

®*[ n ( i -pf)\z] = [ 1 n (x.n(i- Pf )+(i-x))dx. 

feB(e,n) J ° geG(e) f€g 

But for any bundle g G G(e) and < x < 1, we claim that 

x-H(l-Pf) + (l-x)>H(l-xp f ). 

feg feg 
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For singleton bundles we actually have equality. For a bundle g = {/i, /2}, we have x{l — — p/ 2 ) + 

(1 - x) = 1 - xp^ - xp /2 + zp/iP/a > 1 - xp/j - xp /a + x 2 p h p h = (1 - zp/JCl - xp/ 2 ). This 
completes the proof. ■ 

As we shall see shortly, if f<^d~{ e ) Ve is small then the probability that e is not blocked is large. Because 
of the marginal distribution property of the GKSP rounding procedure, we can argue that this quantity is 
small in expectation since X^/eS( e ) VeVe < 2 due to the fact that y is a feasible solution to (ILPlb . This, 
however, is not enough; in fact, for our analysis to go through, we need a slightly stronger property. 



Lemma 9 For every edge e, 



E 



Y Pf \ e£E < PfVf- 



/68(e) 



Proof: Let u be an endpoint of e. 

E[ Yl Pf\ e ^ E } = Y PT lVf = 1 I V* = 1 ] 'Pf> 

fed s (u)-e fed(u)-e 

< Y Pr[y/ = i]-P/, 

fed(u)-e 

= Y y f p f- 



[by Theorem g|P3] 
[by Theorem g]Pl]. 



fed(u)-e 

The same bound holds for the other endpoint of e. Adding the two inequalities we get the lemma. ■ 
Everything is in place to derive a bound the expected weight of the matching found by our algorithm. 

Theorem 10 If G is bipartite then ROUND-COLOR-PROBE is a 1 /p(2, p max ) approximation under the edge- 
and matching-probing model, where p is defined in Lemma\7\ The worst ratio is attained at p max = 1> where 
it is 3. The ratio tends to 1 _ 2 e -2 as p max tends to 0. 

Proof: Recall that the optimal value of (ILPlb is exactly YleeE w eUe x e- The expected weight of the matching 
found by the algorithm is 



E [ ALG ] = Y^ w ePe Pr [ e £ E] • Pr [ e was not blocked | e G E ] 
= Y2 w e Pe ?/e • Pr [ e was not blocked | e 6 E ] 

e<=E 

— Ys We Pe 2/e • ^tt [ Yi ( 1 ~Pf)\ e£ ^ 
e&E f&B(e,%) 

> Y W ePeye ' E ° [ H ( l ~ Pf) \ e ^ ^ 

e£E feB{e,a) 

>^^ e p e y e -E p( Y P/'Pmax) \ e £ E 
e&E fe8 s (e) 

> Y WePeVe ■ [ ^2 Pf\ e ^E 

eGE fed s (e) 



[by Theorem |9]P1] 
[by CD] 
[by Lemma [H 
[by Lemma [6l 
[by Jensen's inequality] 
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> ^w e p e y e ■ p( ^ l/fPfiPmax) [by Lemma HI 

>^w e p e y e - p(2,p max ) [y is feasible for (|LP1|)] . 

Notice that we are able to use Jensen's inequality because, as shown in Lemma|71 p(r,p max ) is a convex and 
decreasing function of r. The last inequality also uses the fact that p is decreasing. 

It can be checked directly (using the first inequality in Lemma 02)) that p(2,p max ) is maximized at 
Pmax = 1 where it is 3. Moreover p(2,p max ) ->• (l - e~ 2 ) /2 as p max tends to 0. ■ 



2.3 Weighted Stochastic Matching: General Graphs Redux 

We present an alternative algorithm for weighted stochastic matching in general graphs that builds on the 
algorithm for the bipartite case. The basic idea is to solve (|LPlb . randomly partition the vertices of G into 
two sets A and B, and then run ROUND-COLOR-PROBE on the bipartite graph induced by (A, B). For the 
analysis to go through, it is crucial that we use the already computed fractional solution instead of solving 
again (ILPlb for the new bipartite graph in the call to ROUND-COLOR-PROBE. 

1. (x, y) <— optimal solution to (|LP1|) 

2. randomly partition vertices into A and B 

3. run ROUND-COLOR-PROBE on the bipartite graph and the fractional solution induced by (A, B) 

Theorem 11 For general graphs there is a 2/p(l,p max ) approximation under the edge- and matching- 
probing model, where p is defined in Lemma [7] The worst ratio is attained at p max = X where it is 4. The 
ratio tends to tzt=i as Pmax tends to 0. 

Proof: The analysis is very similar to the bipartite case. Essentially, conditional on a particular outcome for 
the partition (A, B), all the lemmas derived in the previous section hold. In other words, the same derivation 
done in the proof of Theorem [TT] yields: 

E[ALG | (A,B)]> ^2 w e p e y e -p( ^ 

Pf Hf j Pmax 

ee(A,B) fed AtB (e) 

where d A ,B(e) = d(e) n (A, B). 

Hence, the expectation of algorithm's performance is: 

E[ALG] > Y, w ePeVe Pr[e G (A,B)] -e[ P ( ^ PfVfiPn**) I ee (A B ) 



> ^w e p e y e - -p(e[ Pfy.f\ e £( A ' B ) 
ee£ fea A>B (e) 

>2_^W e p e y e --py 2^ — .Pm; 



e&E /S9(e) 
>^2,W e p e y e -- p(l, p m ax), 



e£E 
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where the second inequality follows from Jensen's inequality and the fact that p(r, p max ) is a convex de- 
creasing function of r. Finally, noting that J2e£E w e Ve Ve is a lower bound on the value of the optimal 
strategy, the theorem follows. ■ 



2.4 Unweighted Stochastic Matching 

In this subsection, we consider the unweighted stochastic matching problem, and show that our algorithm 
from £)2.3| can be combined with the natural greedy algorithm Q to obtain a better approximation guarantee 
than either algorithm can achieve on their own. Basically, our algorithm attains its worst ratio when p max is 
large and greedy attains its worst ratio when p max is small. Therefore, we can combine the two algorithms 
as follows: We probe edges using the greedy heuristic until the maximum edge probability in the remaining 
graph is less than a critical value p c , at which point we switch to our algorithm from £)2.3I We denote by 
ALG this combined algorithm and by OPT the optimal probing strategy. 

Lemma 10 Suppose that we use the greedy rule until all remaining edges have probability less than p c , at 
which point we switch to an algorithm with approximation ratio j(p c )- Then the approximation ratio of the 
overall scheme is a(p c ) = max {4 — p c , 7(j> c )}- 

Proof: First, let us review some facts from the work of Chen et al. Q. Let (G, t) be an instance of the 
edge-probing model. Suppose e = (u,v) is the edge with the largest probability. Denote by (Gl^l) and 
(G R , t R ) the instances resulted from the success and failure for the probe to e, respectively. In other words, 
G L = G\{u,v},t L = t and G R = G\ {e},t R {u) = t(u) - 1, t R (v) = t(v) - l,t R (w) = t(w) / 
u,v. Denote the expected value generated by algorithm ALG on instance (G, i) by Ealg(G, t). Suppose 
Pmax > Pc- It is easy to see that, for any ALG that first probes e, 

E A lg(G, t) = Pe + p e E ALG (G L , t L ) + (1 - p e )E ALG (G R , t R ). (13) 

Moreover, Chen et al. showed that 

E pt(G, t) < Pe (A - Pe ) + p e E p T (G L ,t L ) + (1 - Pe )K P T (G R , t R ). (14) 

Now, we prove the theorem by induction on the size (the number of vertices and edges) of the instance. 
The base cases are all instances in which the maximum probability is at most p c . Then Ealg(G, t) < 
7(p c )Eopt(G, t) < a(p c )Kopj(G, t) for any base instance. The inductive step only concerns instances 
where greedy is used. Thus, following from (fT3l and (fT4l and the inductive hypothesis, we get 

E pt(G, t) < p e a(p c ) + p e a(p c )E ALG (G L ,t L ) + (1 - p e )a(p c )E ALG (G R ,t R ) 
< a(p c )E ALG (G,i). 

This completes the inductive proof. ■ 

We are ready to derive the improved bounds for unweighted stochastic matching. 

Proof of Theorem |3j The stated approximation guarantee can be obtained by setting the cut-off point to 
p c = 0.541 and then using Lemma ITOl in combination with Theorem ITTI for bounding the performance of 
the second algorithm at p max = Pc- ■ 

We remark that the approximation ratio of the algorithm in Section |2~T1 does not depend on p max , thus 
we can not combine that algorithm with the greedy algorithm to get a better bound. Furthermore, the result 
of this subsection only holds for the unweighted version of the problem since greedy has an unbounded 
approximation ratio in the weighted case. 
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3 Stochastic Online Matching (Revisited) 



As mentioned in the introduction, the stochastic online matching problem is best imagined as selling a 
finite set of goods to buyers that arrive over time. The input to the problem consists of a bipartite graph 
G = {A, B,Ax B), where A is the set of items that the seller has to offer, with exactly one copy of each 
item, and B is a set of buyer types/profiles. For each buyer type b G B and item a G A, p a b denotes the 
probability that a buyer of type b will like item a, and w a b denotes the revenue obtained if item a is sold to 
a buyer of type b. Each buyer of type b G B also has a patience parameter t& G Z+. There are n buyers 
arriving online, with e& G Z denoting the expected number of buyers of type b, with ^ e& = n. Let P 
denote the induced probability distribution on B by defining Prp [6] = e&/n. All the above information is 
given as input. 

The stochastic online model is the following: At each point in time, a buyer arrives, where her type 
b Gd B is an i.i.d. draw from V. The algorithm now shows her up to distinct items one-by-one: the 
buyer likes each item a £ A shown to her independently with probability p a j. The buyer purchases the first 
item that she is offered and likes; if she buys item a, the revenue accrued is w a b. If she does not like any of 
the items shown, she leaves without buying. The objective is to maximize the expected revenue. 

We get the stochastic online matching problem of Feldman et al. lITOl if we have w a b = p a t, G {0, 1}, 
in which case we need only consider % = 1. Their focus was on beating the 1 — 1/e-competitiveness 
known for worst-case models ifTTl [T6l |2T1 l5l [121: they gave a 0.67-competitive algorithm that works for 
the unweighted case with high probability. On the other hand, our results are for the weighted case (with 
preference-uncertainty and timeouts), but only in expectation. Furthermore, in our extension, due to the 
presence of timeouts (see §1.21 ). any algorithm that provides a guarantee whp must necessarily have a high 
competitive ratio. 

By making copies of buyer types, we may assume that = 1 for all b G B, and T> is uniform over B. 
For a particular run of the algorithm, let B denote the actual set of buyers that arrive during that run. Let 
G = (A, B,A x B), where for each a G A and I ? G B (and suppose its type is some b G B), the probability 
associated with edge (a, b) is p a b and its weight is w a b- Moreover, for each b G B (with type, say, b G B), 
set its patience parameter to tt = We will call this the instance graph; the algorithm sees the vertices of 
B in random order, and has to adaptively find a large matching in G. 

It now seems reasonable that the algorithm of ^2. II should work here. But the algorithm does not know 
G (the actual instantiation of the buyers) up front, it only knows G, and hence some more work is required 
to obtain an algorithm. Further, as was mentioned in the preliminaries, we use OPT to denote the optimal 
adaptive strategy (instead of the optimal offline matching in G as was done in lITOl ). and compare our 
algorithm's performance with this OPT. 

The Linear Program. For a graph H = (A, C,AxC) with each edge (a, c) having a probability p ac and 
weight w ac , and vertices in C having patience parameters tj, consider the LP(if): 

maximize w ac ■ x ac (LP2) 

aeA, c£C 

subject to 

5> ac <l VaeA (15) 

^x ac <l VceC (16) 

aeA 

Y,Vac<t c VceC (17) 

aeA 
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Vae A,ceC (18) 
V«ei,ceC (19) 

Note that this LP is very similar to the one in $2l but the vertices on the left do not have timeout values. 
Let LP(H) denote the optimal value of this LP. 

The algorithm: 

1. Before any buyers arrive, solve the LP on the expected graph G to get values y*. 

2. When any buyer b (of type b) arrives online: 

a. If b is the first buyer of type b, consider the items a G A in u.a.r. order. One by one, offer each 
item a (that is still unsold) to b independently with probability y* b /a; stop if either tj, offers are 
made or b purchases any item. 

b. If b is not the first arrival of type b, do not offer any items to b. 

In the following, we prove that our algorithm achieves a constant approximation to the stochastic online 
matching problem. The first lemma show that the expected value obtained by the best online adaptive 
algorithm is bounded above by E[LP(G)]. 

Lemma 11 The optimal value OPT of the given instance is at most E[LP(G)], where the expectation is 
over the random draws to create G. 

Proof: Consider an algorithm that is allowed to see the instantiation B of the buyers before deciding on the 
selling strategy — the expected revenue of the best such algorithm is clearly an upper bound on OPT. Given 
any instantiation B, the expected revenue of the optimal selling strategy is at most LP(G) (see e.g. Claim[TJ). 
The claim follows by taking an expectation over B. ■ 

The proof of the next lemma is similar to the analysis of Theorem Q] for weighted stochastic matching. 
Lemma 12 Our expected revenue is at least (l — i) ^ (l — ^ — • LP(G). 

Proof: For any buyer- type b G B, in this proof, b refers to the first type- 6 buyer (if any). For each b G B, 
let r.v. Tf, G [n] U {oo} denote the earliest arrival time of a type-6 buyer; if there is no type-6 arrival then 
Tb = oo. Note that our algorithm obtains positive revenue only for buyers {b | b G B, T b < oo}; let i?j 
denote the revenue obtained from buyer b (if any). The expected revenue of the algorithm is E[J2beB Rb\- 
We now estimate E[i?b] for a fixed b G B. 

Let Ab = (Tj, < oo) denote the event that there is some type-6 arrival in the instantiation B. Since each 
arrival is i.i.d. from the uniform distribution over B, Pr[A] = 1 - (1 - l/n) n > 1 - i. In the following, 
we condition on Ab and bound E[i2& | Ab]- Hence we assume that buyer b exists. 

For any vertex a G A, let M a denote the indicator r.v. that a is already matched before time T^; and 
O a (resp. M' a ) the indicator r.v. that b is timed-out (resp. already matched) when item a is considered for 
offering to b. Now, 

Prfitem a offered to b I Ab ] = (1 - Pr[ M a U M' U O a I A b ]) • — 

a 

> (1 - Pr[M a | A b ] - Pr[M' a U O a | A b ]) ■ — (20) 

a 

Claim 2 For any a £ A and b G B, Pr[ M a | Ab ] < ^. 



%ac — Pac ' Vac 

Vac e [o, l] 
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Proof: For any v G B \ {&}, let denote the indicator r.v. for the event T v < T b . We have: 



Pr[ M a I A b ] = Pr[ type-u buyer is matched to a before time T b | A b ] 

v£B\{b} 



(21) 



= Pr t T b \A b ]-Pr[v matched to a\P b \A b ] 



(22) 



v£B\{b} 



(23) 



where the first inequality follows from the fact that even after the algorithm has considered an edge (a, v), 
the probability of matching (o, v) is ^ • p av , the last inequality uses LP-constraint (fT3T ) for graph G, and 
the second last inequality uses Pr[/£ I A b ] < \ (for u G £ \ {6}), which we show next. 

Note that event 1% A .4.;, corresponds to (T v < T b < 00); and event .A;, contains both (T v < T b < 00) 
and (T b <T V < 00). By symmetry, Pr[T„ < T b < 00 ] = Pr[T b < T v < 00], which implies: 



Claim 3 For any a G A and b G -B, Pr[M^ U O a \ A b } < ^ + 

Proof: This is a direct application of Lemmas Q] and [H since items offered to b are considered in u.a.r. order. 
As in E j2.ll there are two cases: 

• Suppose t b = l. Here we have M' a C O a , so Pr[ M„ U O a \ A b ] = Pr[O a \ A b ] < ^, by the proof 
of Lemma Q] using LP-constraint (fTTT ). 

• Suppose tf, > 2. Using the proof of Lemma|2]and LP-constraint (fT71) . we have Pr[O a | ^] < 
Again by the proof of Lemma[T]and LP-constraint (fTBT ). Pr[M^ | A b ] < 

In both cases above, the statement in Claim[3]holds. ■ 
Now applying Claims [2] and [3] to (l20l ). we obtain: 




< 



Pr[r„ < r 6 < 00] 1 



Pr[ (T v < T b < 00) V (T b < T v < 00) ] 2 ' 



This completes the proof of Claim [2 



Prfitem a offered to b \ A b ] > — ( 1 k ) • y ab - 

a \ a 3ar I 





This implies: 



E[R b \A b ] = J2 



w ab • Pab • Pr [ item a offered to b \ A b ] 




Since Pr[ A b ] > 1 - ±, we also have E[R b ] > (1 - 1) I (l 



1 
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Finally, the expected revenue obtained by the algorithm is: 



£.[jy>(i-i)I(i-i-^).LP (0 ). 

fees \ / \ / 

This proves Lemma [121 ■ 

Note that we have shown that E[LP(G)] is an upper bound on OPT, and that we can get a constant frac- 
tion of LP(G). The final lemma relates these two, namely the LP-value of the expected graph G (computed 
in Step 1) to the expected LP-value of the instantiation G; the proof uses a simple but subtle duality-based 
argument. 

Lemma 13 LP(G) > E[LP(G)]. 

Proof: Consider the dual of the linear program (1LP21 ). 

min ^ a a + ^2 ( Qc + tc ' ^ + 5^ Zac ( 24 ^ 

a£A c&C a£A,c£C 

Zac + Vac ■ (Oi a + OL c ) + f3 c > W ac 'Vac Va G A, C G C (25) 

a,/3,z>0 (26) 

Let (a, j3, z) denote the optimal dual solution coiTesponding to graph G; note that its objective value equals 
LP(G) by strong duality For any instantiation G, define dual solution (a, (3, z) as follows: 

• For all a G A, a a = a a . 

• For each c € B (of type b), a c = a b and $ c = Pb- 

• For each a G A and c G B (of type b), z ac = z ab . 

Note that (a, (3, z) is a feasible dual solution coiTesponding to the LP on G: there is constraint for each 
a G A and c G B, which reduces to a constraint for (a, /3, z) in the dual corresponding to G. By weak 
duality, the objective value for (a,/3, z) is an upper-bound on LP(G). For each b G B, let N b denote the 
number of type b buyers in the instantiation B; note that K[N b ] = 1 by definition of distribution V. Then 
the dual objective for (a, (3, z) satisfies: 

Y^a a + Y, N b- (a b + t b -/3 b )+ Yl N b' z ab > LP(G). 

aeA feeS aeA.feeS 

Taking an expectation over B, we obtain: 

E[LP(G)] <J2 a * + J2 E ^ ■ ( ab + tb ■ P* + J2 Zab J 

a&A feeS V a&A J 

= ^2a a + Y2{a b + t b - (3 b ) + ^ = LP(G). 

aeA fees aeA, fees 

This proves the lemma. ■ 
Applying Lemmas ITT1 [121 and [T3l and setting a = ^ - , completes Theorem |4fs proof. 
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4 Stochastic k-Set Packing 



We now consider a generalization of the stochastic matching problem to hypergraphs, where each edge has 
size at most k. Formally, the input to this stochastic k-set packing problem consists of 

• n items/columns, where each item has a random profit Vi S R+, and a random d-dimensional size 
Si G {0, l} d ; these random values and sizes are drawn from a probability distribution specified as part 
of the input. The probability distributions for different items are independent, as are the probability 
distributions for the value and the size for any of the items. Additionally, for each item, there is a set 
Ci of at most k coordinates such that each size vector takes positive values only in these coordinates; 
i.e., Si C d with probability 1 for each item i. 

• A capacity vector b G into which the items must be packed. 

The parameter k is called the column sparsity of the problem. The instantiation of any column (i.e., its 
size and profit) is known only when it is probed. The goal is to compute an adaptive strategy of choosing 
items until there is no more available capacity such that the expectation of the obtained profit is maximized. 

Note that the stochastic matching problem can be modeled as a stochastic 4-set packing problem in the 
following way: we set d = 2n, and associate the i th and (n + i) th coordinate with the vertex i — the first n 
coordinates capture whether the vertex is free or not, and the second n coordinates capture how many probes 
have been made involving that vertex. Now each edge is an item whose value is Wij; if et £ {0, l} d 
denotes the indicator vector with a single 1 in the t th position, then the size of the edge is either 
ej + ej + e n+ i + e n +j (with probability p{) or e n+ i + e n+ j (with probability 1 — p{). If we set the capacity 
vector to be b = (1, 1, • • • , 1, ti,t%, • • • , t n ), this precisely captures the stochastic matching problem. Thus, 
each size vector has < k = 4 ones. 

This stochastic fc-set packing problem was studied (among many others) as the "stochastic 6-matching" 
problem in Dean et al. (H; however the authors of that work did not consider the 'column sparsity' parameter 
k and instead gave an 0(\/5)-approximation algorithm for the general. Here we consider the performance 
of algorithms for this problem specifically as a function of the column sparsity k, and prove Theorem [7] 

A quick aside about "safe" and "unsafe" adaptive policies: a policy is called safe if it can include an 
item only if there is zero probability of violating any capacity constraint. In contrast, an unsafe policy 
may attempt to include an item even if there is non-zero probability of violating capacity — however, if the 
random size of the item causes the capacity to be violated, then no profit is received for the overflowing 
item, and moreover, no further items may be included by the policy. The model in Dean et al. (H allowed 
unsafe policies, whereas we are interested (as in the previous sections) in safe policies. However, due to the 
discreteness of sizes in stochastic k-set packing, it can be shown that our approximation guarantee is relative 
to the optimal unsafe policy. 

For each item i € [n] and constraint j G [d], let := E[5i(j)], the expected value of the j th 

coordinate in size-vector Si. For each column i G [n], the coordinates {j G [d] | f^i(j) > 0} are called 
the support of column i. By column sparsity, the support of each column has size at most k. Also, let 
Wi := E[uj], the mean profit, for each i £ [n]. We now consider the natural LP relaxation for this problem, 
as in H. 

n 

maximize tu, ■ (LP3) 

i=l 

subject to 

n 

Y,IH{j)-Vi<bi Vje[d] (27) 

i=l 
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Vi G [0, 1] Vi e [n] (28) 

Let y* denote an optimal solution to this linear program, which in turn gives us an upper bound on any 
adaptive (safe) strategy. Our rounding algorithm is a natural extension of the one for stochastic matching 
in £12.11 Fix a constant a > 1, to be specified later. The algorithm picks a uniformly random permutation 
tt : [n] — > [n] on all columns, and probes only a subset of the columns as follows. At any point in the 
algorithm, column c is safe iff there is positive residual capacity in all the coordinates in the support of c — 
in other words, irrespective of the instantiation of S c , it can be feasibly packed with the previously chosen 
columns. The algorithm inspects columns in the order of tt, and whenever it is safe to probe the next column 
c € [n], it does so with probability ^. Note that the algorithm skips all columns that are unsafe at the time 
they appear in tt. 

We now prove Theorem[7]by showing that this algorithm is a 2/c-approximation. The analysis proceeds 
similar to that in £12.11 For any column c S [n], let {i c .e}e=i denote the indicator random variables for the 
event that the £ th constraint in the support of c is tight at the time when c is considered under the random 
permutation tt. Note that the event "column c is safe when considered" is precisely f\g =1 I c ,e- By a trivial 
union bound, the Pr[c is safe] > 1 — J2e=i P r [Icd- 

Lemma 14 For any column c £ [n] and index I G [k], Pr[I Cj ^] < 

Proof: Let j € [d] be the £ th constraint in the support of c. Let Ui denote the usage of constraint j, when 
column c is considered (according to tt). Then, using argument similar to those used to prove LemmaQ] we 
have 

n 

E[E/£'] = J^Pr[ column a appears before c AND a is probed ] • /U a (j), 

a=l 

n 

< y^Prf column a appears before c] • — • U a (j), 

a=l 

n 

(2=1 

h 

< —. 

~ 2a 

Since I c j = {U % c > hi}, Markov's inequality implies that Pr[l c /] < E[U£]/bi < ^. ■ 

Again using the trivial union bound, the probability that a particular column c is safe when considered 
under tt is at least 1 — and thus the probability of actually probing c is at least ^(1 — ^-). Finally, by 
linearity of expectations, the expected profit is at least ^(1 — ^-) • J2c=i w £ ' Setting a = k implies an 
expected profit of at least Jj- ■ Y^ c w c y c , which proves Theorem |7J 

5 Final Remarks 

An extended abstract of this paper appeared in the Proceedings of the 18th Annual European Symposium on 
Algorithms (3J- The bounds presented here in £j2]are slightly better than those claimed in the extended ab- 
stract. Quite recently, Adamczyk has proved that the greedy algorithm is a 2-approximation for unweighted 
stochastic matching (TJ, improving our bounds from Theorem [3] It remains an open question whether the 
stochastic matching problem is NP-complete. 



20 



Acknowledgements. We would like to thank Aravind Srinivasan for helpful discussions. 



References 

[1] M. Adamczyk. Greedy algorithm for stochastic matching is a 2-approximation, 2010. 
arXiv: 1007.3036. 

[2] B. Bahmani and M. Kapralov. Improved bounds for online stochastic matching. In ESA, 2010. 

[3] N. Bansal, A. Gupta, J. Li, J. Mestre, V. Nagarajan, and A. Rudra. When LP is the Cure for Your 
Matching Woes: Improved Bounds for Stochastic Matchings. In ESA, 2010. 

[4] S. Bhattacharya, G. Goel, S. Gollapudi, and K. Munagala. Budget constrained auctions with heteroge- 
neous items. In STOC, 2009. Arxiv:abs/0907.4166. 

[5] B. E. Birnbaum and C. Mathieu. On-line bipartite matching made simple. SIGACT News, 39(l):80-87, 
2008. 

[6] R. Carr and S. Vempala. Randomized metarounding. Random Structures Algorithms, 20(3):343-352, 
2002. Probabilistic methods in combinatorial optimization. 

[7] N. Chen, N. Immorlica, A. R. Karlin, M. Mahdian, and A. Rudra. Approximating matches made in 
heaven. In ICALP (1), pages 266-278, 2009. 

[8] B. C. Dean, M. X. Goemans, and J. Vondrak. Adaptivity and approximation for stochastic packing 
problems. In SODA, pages 395^104, 2005. 

[9] B. C. Dean, M. X. Goemans, and J. Vondrak. Approximating the stochastic knapsack problem: the 
benefit of adaptivity. Math. Oper. Res., 33(4):945-964, 2008. 

[10] J. Feldman, A. Mehta, V. S. Mirrokni, and S. Muthukrishnan. Online stochastic matching: Beating 
1 - 1/e. In FOCS, 2009. Arxiv:abs/0905.4100. 

[11] R. Gandhi, S. Khuller, S. Parthasarathy, and A. Srinivasan. Dependent rounding and its applications to 
approximation algorithms. JACM, 53(3):360, 2006. 

[12] G. Goel and A. Mehta. Online budgeted matching in random input models with applications to ad- 
words. In SODA, pages 982-991, 2008. 

[13] S. Guha and K. Munagala. Approximation algorithms for partial-information based stochastic control 
with markovian rewards. In FOCS, pages 483^93, 2007. 

[14] S. Guha and K. Munagala. Multi-armed bandits with metric switching costs. In ICALP, pages 496- 
507, 2009. 

[15] A. Gupta, M. Pal, R. Ravi, and A. Sinha. Boosted sampling: approximation algorithms for stochastic 
optimization. In STOC, pages 417^26. ACM, 2004. 

[16] B. Kalyanasundaram and K. Pruhs. Online weighted matching. /. Algorithms, 14(3):478-488, 1993. 



21 



[17] R. M. Karp, U. V. Vazirani, and V. V. Vazirani. An optimal algorithm for on-line bipartite matching. 
In STOC, pages 352-358, 1990. 

[18] I. Katriel, C. Kenyon-Mathieu, and E. Upfal. Commitment under uncertainty: Two-stage stochastic 
matching problems. Theoretical Computer Science, 408(2-3):2 13-223, 2008. 

[19] M. Mahdian, H. Nazerzadeh, and A. Saberi. Allocating online advertisement space with unreliable 
estimates. In EC, page 294, 2007. 

[20] V. H. Manshadi, S. O. Gharan, and A. Saberi. Online stochastic matching: Online actions based on 
offline statistics, 2010. arXiv: 1007. 1673. 

[21] A. Mehta, A. Saberi, U. V. Vazirani, and V. V. Vazirani. Adwords and generalized on-line matching. 
In FOCS, pages 264-273, 2005. 

[22] A. Schrijver. Combinatorial Optimization. Springer- Verlag, 2003. 

[23] D. Shmoys and C. Swamy. An approximation scheme for stochastic linear programming and its appli- 
cation to stochastic integer programs. /. ACM, 53(6): 1012, 2006. 

[24] C. Swamy and D. Shmoys. Approximation algorithms for 2-stage stochastic optimization problems. 
ACM SIGACT News, 37(1):46, 2006. 

A Cardinality Constrained Multiple Round Stochastic Matching 

We now consider stochastic matching with a different objective in mind; this was also defined in Q. In 
this problem, we arrange for many pairs to date each other simultaneously (constrained by the fact that 
each person is involved in at most one date at any time), and have k days in which all these dates must 
happen — again, we want to maximize the expected weight of the matched pairs. 

More formally, we can probe several edges concurrently — a "round" may involve probing any set of 
edges that forms a matching of size at most C. Given k and C, the goal is to find an adaptive strategy for 
probing edges in rounds such that we use at most k rounds, and maximize the expected weight of matched 
edges during these k rounds. As before, one can probe edges involving individual i at most t{ times, and 
only if i is not already matched by the algorithm. In this section, we give a constant-factor approximation for 
this problem, improving over the previously known 0(min{/c, C}) -approximation Q (which only works 
for the unweighted case). 

Our approach, as in the previous sections, is based on linear programming. The following LP captures 
adaptive strategies, and hence is a relaxation of the multiple round stochastic matching problem; moreover, 
it can be solved in poly-time. Below, Mc(G) denotes the convex hull of all matchings in G having size at 
most C. 

k 

maximize Wij ■ (LP4) 

(i,j)£E h=i 

subject to 

k 
h=l 
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Mi e V (30) 

Vh G [k] (31) 
V(i,j)G£,hG[*] (32) 

Mi eV (33) 

Since there is a linear description for .Mc^G), for which we can separate in polynomial time ll22l 
Corollary 18.10a]), the above LP can be solved in polynomial time using, say, the Ellipsoid algorithm. 
To see that this LP is indeed a relaxation of the original adaptive problem, observe that setting to be 
"probability that (ij) is probed in round h by the optimal strategy" defines a feasible solution to the LP with 
objective equal to the optimal value of the stochastic matching instance. 

Our algorithm first solves the LP to optimality and obtains solution (x, y). Note that for each h G [k], 
using the fact that polytope Mc{G) is integral and that the variables y h G M.c{G), we can write y h as 
a convex combination of matchings of size at most C; i.e., we can find matchings {M^}^ and positive 
values {\p}e such that each Mft is a matching in G of size at most C and y h = ' x(Mg), where 

x(M]?) denotes the characteristic vector corresponding to the edges that the are present in the matching. 
(See, e.g. (H, for a polynomial-time procedure.) Fixing the parameter a to a suitable value to be specified 
later, the algorithm does the following. 



1. for each round h = 1, • • • , k do 
a. define the h th matching 




P h := | 


with probability 1 — ^ 
Mj 1 with probability ^ 


b. Probe all edges in P h that are safe. 





We show that this algorithm is a 20-approximation for a = 10, which proves Theorem[6l 

As before, an edge (i, j) G E is said to be safe iff (a) (i, j) has not been probed earlier, (b) neither i nor 
j is matched, and (c) neither i nor j has timed out. 



Lemma 15 For any edge G E, and at round h G [k], Pr[ is safe in round h] > 1 — — . 

Proof: We will show that the following three statements hold at round h: 

i. Pr[ has probed] < ^. 

ii. Pr[ vertex i is already timed out] < ^. 

iii. Pr[ vertex i is already matched] < — . 
Since Pr[ is not safe in round h ] is at most 

Pr[ been probed] + Pr[i matched] + Pr[i timed out] + Pr[ j matched] + Pr[j timed out] 

by the trivial union bound, proving (i)-(iii) will prove the lemma. To prove (i), observe that for any edge e G 
E and round g, Pr [ e probed in round g ] < Pr [e G P g ] = ^ yi , and hence Pr [ (i, j) probed before round h ] < 
a ^a<h Ve — a' wnere trie Iast inequality uses LP constraint d29l ). 



jea(i) h=i 

y h G Mc(G) 

x ij — Pij ' Vij 

jed(i) h=i 
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The proof for (iii) is identical, using the LP constraint 03]). The proof for statement (ii) is also similar, 
though one upper bounds the expected value of the number of times the vertex i is probed (in this step one 
needs to use the LP constraints (l30l) ) and then uses Markov inequality. ■ 



Theorem 12 Setting a = 10 gives a 20-approximation for multiple round stochastic matching. 
Proof: Using Lemma [T5l we have for any edge (i, j) £ E and round h £ [k], 

Pr [ (i , j) probed in round h ] = Pr[ safe in round h] ■ Pr[ £ P ft | (i, j) safe in round h 

~\~a)' Pr ^*'^ G ^ ' ^ safeinround M 

1 _5\.Vij 



a 1 a 



where the equality follows from the fact that events £ F h and is safe in round h are independent. 
Thus the expected value accrued by the algorithm is 



k 

Ve'Pe, 



w e ■ ^2 P r [ e probed in round h ] ■ p e > — ( 1 J ■ u; e • 

eSE h=l a ^ °^ eG-B h=l 

which is ^ (l — ^) times the optimal LP-value. Setting a = 10 completes the proof. ■ 

B Unweighted Stochastic Matching: A Greedy Algorithm 

In this section we consider a greedy algorithm for the unweighted stochastic matching problem: in this 
unweighted version, all edges have unit weight, and the goal is to maximize the expected number of matched 
edges. The greedy algorithm was proposed by Chen et al. Q, and they gave an analysis proving it to be 
a 4-approximation; however, the proof was fairly involved. Here, we give a significantly simpler analysis 
showing an approximation guarantee of 5. The greedy algorithm we consider is the following: 

1. Let a denote the ordering of the edges in E by non-increasing p e -values. 

2. Consider the edges e £ E in the order given by a 

a. If edge e is safe then probe it, else do not probe e. 

Recall that an edge is safe if neither of its endpoints have been matched or timed out. Note that the expected 
value of the greedy algorithm is 

ALG = Pr [ e is matched ] = Pr[ e is probed ] • p e . 



B.l The Analysis 

While the algorithm does not have anything to do with the linear programming relaxation we presented 
in the previous section, we will use that LP for our analysis. Consider the optimal LP solution (x*,y*), 
and recall that (x*,y*) satisfy the conditions £[])-©. (Alternatively, use the fractional solution y* := 
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Pr[ e is probed in the optimal strategy ] and x* e := Pr[ e is matched in the optimal strategy ].) For each e = 
6 E, define the following three events: 



M e := either i or j is matched when e is considered in a, 

R e := either i or j has timed out when e is considered in a, and 

B e := M e V R e . 

By the algorithm, it follows that Pr [ e is probed ] = 1 — Pr[B e ] for all e 6 £. So, 

ALG = ]T(1 - Pr[B e ]) Pe > - Pr[B e ]) ■ vlPe (34) 

e£E e£E 

The following two lemmas charge the value accrued by the algorithm in two different ways to the optimal 
LP solution. 

Lemma 16 2ALG > Z 9 eE Pr I M 9 ] ■ V* g ■ Pg- 

Proof: In the greedy algorithm, whenever edge e = (i, j) gets matched, write value of f„ f on each edge 
/ £ d(i) (J d(j). Note that the total value written when edge e = (£, j) gets matched is at most: 

x - y* f Pf x - 1 * 1 * -, 

E -V + E ^ = 2 E */ + 2 E 

/6fl(i) /eS(j) /6fl(i) /efl(j) 

where the inequality follows from (Q]). Recall that in any possible execution of Greedy, an edge is matched at 
most once. Thus the expected total value written (on all edges) is at most ^ e e£ Pr t e * s matched ] = ALG. 

On the other hand, whenever event M g occurs in the greedy algorithm (at some edge g = (a, b) £ E), 
read VmIi. value from g. Consider any outcome where event M g occurs: it must be that either a or b was 

already matched (say via edge e); this in turn means that ^^2. value was written on edge g at the time when 
e got matched (since g is adjacent to e). Thus the value read from an edge (at any point) is at most the 

value already written on it. Thus the expected total value read from all edges is Ylg&E FM-^sl ' — 
E [total value written] < ALG. ■ 



Lemma 17 2ALG > E g& E ^[Rg] ■ V* g ' Pg- 

Proof: Consider the execution of the greedy algorithm, with a value a e denned on each edge e € E 
(initialized to zero). Whenever an edge e = gets probed, do (where a e denotes the edges in E that 
appear after e in a): 

1. For each / G d(i) n a e , increase a/ by -^r- 

2. For each / £ d(j) n a e , increase aj by -4^— . 

Let A := YleeE a e- Note tnat tne mcreas e in A when edge e = gets probed is: 

E E 4r^(f E ^ E rfW 
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where for the first inequality we use the greedy property that p e > pj for all f & a e and the sec- 
ond inequality follows from ©. Thus the expected value of A at the end of the greedy algorithm is 
E[^4 at the end of Greedy] < ^ egE Pr[e is probed] • p e = ALG. (Recall that in any possible execution 
of Greedy, an edge is probed at most once.) 

On the other hand, whenever event R g occurs in the greedy algorithm (at some edge g = (a, b) E E), 
read the value a g from g. Consider any outcome where event R g occurs: it must be that either a or b 
was already timed out (say vertex a). This means that t a edges from d{a) have already been probed. By the 

updates to o-values defined above, since g is adjacent to each edge in d(a), the current value a g > t a -^r- = 
y*p g /2. So whenever R g occurs, the value read a g > y*p g /2. I.e. the expected total value read is at least 

YlgeE ^^[Rg] ' ^TT 2 "- However, the total value read is at most the value A at the end of the greedy algorithm. 

This implies that Y, g eE Fl i R !j} ' ^ E [total value read] < E[A at the end of Greedy] < ALG. ■ 

Proof of Theorem [5} Adding the expressions from Lemmas [16] and [T7J we get 

4 ALG > ^2(PT[M e }+PT[R e ])-y* ePe > ^Pr[£? e ] • y* ePe , 

where the second inequality uses the definition B e = M e V R e . Adding this to 04l ), we obtain 5 ALG > 
YleeE Ve ' Ve, which is the optimal LP objective. Thus, the greedy algorithm is a 5 -approximation. ■ 
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